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(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinia equina 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 99. .695 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . .695 

(ix) FEATURE: 

(A) NAME/KEY: sigjeptide 

(B) LOCATION: 3 . .98 

(ix) FEATTOE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION:!. .2 

(ix) FEATURE: 

(A) NAME/ KEY : 3 ' UTR 

(B) LOCATION: 696. .888 
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(F) PAGES: 549-560 

(G) DATE: 1998 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CT ATG OCT CTT AGO CAA AAC CAA GCC AAG TTT TCC AAA GGA TTC GTC 
Met Ala Leu Ser Gin Asn Gin Ala Lys Phe Ser Lys Gly Phe Val 
-32 -30 -25 -20 



47 



GTG ATG ATT TGG GTA CTA TTC ATT GCT TGT GCT ATA ACT TCA ACT GAA 
Val Met lie Trp Val Leu Phe lie Ala Cys Ala lie Thr Ser Thr Glu 
-15 -10 -5 



95 



GCT AGT CTA ACC AAA TGC CAA CAG CTC CAG GCC TCG GCT AAC AGT GGT 
Ala Ser Leu Thr Lys Cys Gin Gin Leu Gin Ala Ser Ala Asn Ser Gly 
15 10 15 



143 



CTG ATA GGT ACT TAT GTA CCA CAA TGC AAA GAA ACG GGA GAG TTC GAA 
Leu He Gly Thr Tyr Val Pro Gin Cys Lys Glu Thr Gly Glu Phe Glu 
20 25 30 



191 



GAA AAA CAA TGC TGG GGA TCG ACT GGT TAG TGT TGG TGT GTG GAT GAA 
Glu Lys Gin Cys Trp Gly Ser Thr Gly Tyr Cys Trp Cys Val Asp Glu 
35 40 45 



239 



GAT GGA AAA GAG ATT CTA GGA ACC AAG ATC CGT GGA TCT CCG GAT TGC 
Asp Gly Lys Glu He Leu Gly Thr Lys He Arg Gly Ser Pro Asp Cys 
50 55 60 



287 



AGC CGC AGA AAA GCC GCG TTA ACA CTT TGC CAG ATG ATG CAA GCC ATC 
Ser Arg Arg Lys Ala Ala Leu Thr Leu Cys Gin Met Met Gin Ala He 
65 70 75 



335 



ATT GTT AAT GTC CCT GGT TGG TGT GGC CCT CCA TCG TGT AAA GCT GAC 
He Val Asn Val Pro Gly Trp Cys Gly Pro Pro Ser Cys Lys Ala Asp 
80 85 90 95 



383 



GGC AGT TTT GAC GAG GTT CAG TGC TGC GCA AGT AAT GGA GAA TGC TAC 
Gly Ser Phe Asp Glu Val Gin Cys Cys Ala Ser Asn Gly Glu Cys Tyr 
100 105 110 



431 



TGT GTG GAT AAG AAA GGA AAA GAA CTT GAA GGC ACA AGA CAA CAG GGA 
Cys Val Asp Lys Lys Gly Lys Glu Leu Glu Gly Thr Arg Gin Gin Gly 
115 120 125 



479 



AGG CCA ACC TGC GAA AGA CAC CTA AGC GAA TGC GAG GAA GCT CGA ATC 
Arg Pro Thr Cys Glu Arg His Leu Ser Glu Cys Glu Glu Ala Arg He 
130 135 140 



527 



AAG GCG CAT TCA AAC AGT CTT CGT GTT GAG ATG TTC GTG CCA GAG TGT 
Lys Ala His Ser Asn Ser Leu Arg Val Glu Met Phe Val Pro Glu Cys 
145 150 155 



575 



TTA GAA GAT GGA TCA TAT AAC CCA GTA CAG TGC TGG CCT AGC ACA GGA 
Leu Glu Asp Gly Ser Tyr Asn Pro Val Gin Cys Trp Pro Ser Thr Gly 
160 165 170 175 



623 



TAC TGT TGG TGC GTC GAT GAA GGA GGG GTA AAG GTA CCA GGT TCC GAT 
Tyr Cys Trp Cys Val Asp Glu Gly Gly Val Lys Val Pro Gly Ser Asp 



671 




180 



185 



190 



GTC AGA TTT AAA CGC CCC ACA TGC TAAGAAAAAC ACAGTGAACA AAGTGGCTAG 
Val Arg Phe Lys Arg Pro Thr Cys 
195 



725 



TTTCCAGATC GAAAATAACT ACAAAGGATT AATAAAATGT TAAAATAATT TCTCAATTCG 



785 



GCTGTGATAT ATTTTTTCCA AGATAATTTA ATCTGCATGT AGTTAACAGA AAACAATCTC 



845 



AACTAGAAAT AAAGACTACG GTAATAATGA CAAAAAAAAA AAA 



888 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Leu Ser Gin Asn Gin Ala Lys Phe Ser Lys Gly Phe Val Val 
-32 -30 -25 -20 

Met He Trp Val Leu Phe He Ala Cys Ala He Thr Ser Thr Glu Ala 
-15 -10 -5 

Ser Leu Thr Lys Cys Gin Gin Leu Gin Ala Ser Ala Asn Ser Gly Leu 
15 10 15 

He Gly Thr Tyr Val Pro Gin Cys Lys Glu Thr Gly Glu Phe Glu Glu 
20 25 30 

Lys Gin Cys Trp Gly Ser Thr Gly Tyr Cys Trp Cys Val Asp Glu Asp 
35 40 45 

Gly Lys Glu He Leu Gly Thr Lys He Arg Gly Ser Pro Asp Cys Ser 
50 55 60 

Arg Arg Lys Ala Ala Leu Thr Leu Cys Gin Met Met Gin Ala He He 
65 70 75 80 

Val Asn Val Pro Gly Trp Cys Gly Pro Pro Ser Cys Lys Ala Asp Gly 
85 90 95 

Ser Phe Asp Glu Val Gin Cys Cys Ala Ser Asn Gly Glu Cys Tyr Cys 
100 105 110 

Val Asp Lys Lys Gly Lys Glu Leu Glu Gly Thr Arg Gin Gin Gly Arg 
115 120 125 

Pro Thr Cys Glu Arg His Leu Ser Glu Cys Glu Glu Ala Arg He Lys 
130 135 140 



Ala His Ser Asn Ser Leu Arg Val Glu Met Phe Val Pro Glu Cys Leu 



145 150 155 160 

Glu Asp Gly Ser Tyr Asn Pro Val Gin Cys Trp Pro Ser Thr Gly Tyr 
165 170 175 

Cys Trp Cys Val Asp Glu Gly Gly Val Lys Val Pro Gly Ser Asp Val 
180 185 190 

Arg Phe Lys Arg Pro Thr Cys 
195 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinia equina 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: optimized gene for expression in plants 

(ix) FEATURE: • 

(A) NAME /KEY: CDS 

(B) LOCATION:!. .693 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 97 . .693 

(ix) FEATURE: 

(A) NAME/KEY: sig_j)eptide 

(B) LOCATION:!. .693 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

ATG GCT CTT AGC CAG AAC CAG GCC AAG TTT TCC AAG GGA TTC GTC GTG 4 8 

Met Ala Leu Ser Gin Asn Gin Ala Lys Phe Ser Lys Gly Phe Val Val 
-32 -30 -25 -20 

ATG ATT TGG GTA CTA TTC ATT GCT TGT GCT ATC ACT TCA ACT GAA GCT 96 
Met lie Trp Val Leu Phe lie Ala Cys Ala lie Thr Ser Thr Glu Ala 
-15 -10 -5 

AGT CTA ACG AAA TGC CAA CAG CTG CAG GCC TCG GCT AAC AGT GGT CTG 144 
Ser Leu Thr Lys Cys Gin Gin Leu Gin Ala Ser Ala Asn Ser Gly Leu 
15 10 15 



ATA GGT ACT TAT GTA CCA CAA TGC AAA GAA ACT GGA GAG TTT GAA GAA 192 
lie Gly Thr Tyr Val Pro Gin Cys Lys Glu Thr Gly Glu Phe Glu Glu 
20 25 30 

AAG CAA TGC TGG GGA TCG ACT GGT TAG TGT TGG TGT GTG GAT GAA GAT 240 
Lys Gin Cys Trp Gly Ser Thr Gly Tyr Cys Trp Cys Val Asp Glu Asp 
35 40 45 

GGA AAA GAG ATT CTA GGT ACA AAG ATC CGT GGA TCT CCA GAC TGC AGT 28 8 

Gly Lys Glu lie Leu Gly Thr Lys He Arg Gly Ser Pro Asp Cys Ser 
50 55 60 

CGC AGA AAA GCT GCC TTA ACA CTT TGC CAG ATG ATG CAA GCC ATC ATT 336 
Arg Arg Lys Ala Ala Leu Thr Leu Cys Gin Met Met Gin Ala He He 
65 70 75 80 

GTG AAT GTC CCT GGT TGG TGT GGA CCT CCA TCA TGT AAA GCT GAC GGC 3 84 

Val Asn Val Pro Gly Trp Cys Gly Pro Pro Ser Cys Lys Ala Asp Gly 
85 90 95 

AGT TTT GAC GAG GTT CAG TGC TGC GCA AGT AAT GGA GAA TGC TAC TGT 432 
Ser Phe Asp Glu Val Gin Cys Cys Ala Ser Asn Gly Glu Cys Tyr Cys 
100 105 110 

GTG GAT AAG AAA GGA AAA GAA CTT GAA GGC ACA AGA CAA CAG GGA AGG 480 
Val Asp Lys Lys Gly Lys Glu Leu Glu Gly Thr Arg Gin Gin Gly Arg 
115 120 125 

CCA ACC TGC GAA AGA CAC CTA AGC GAA TGC GAG GAG GCT CGT ATC AAG 52 8 

Pro Thr Cys Glu Arg His Leu Ser Glu Cys Glu Glu Ala Arg He Lys 
130 135 140 

GCA CAT TCA AAC AGT CTT CGT GTT GAG ATG TTC GTG CCA GAG TGT TTA 57 6 

Ala His Ser Asn Ser Leu Arg Val Glu Met Phe Val Pro Glu Cys Leu 
145 150 155 160 

GAA GAT GGA TCT TAC AAC CCT GTA CAG TGC TGG CCT AGC ACA GGA TAC 624 
Glu Asp Gly Ser Tyr Asn Pro Val Gin Cys Trp Pro Ser Thr Gly Tyr 
165 170 175 

TGT TGG TGC GTC GAT GAA GGA GGG GTA AAG GTT CCA GGT TCC GAC GTC 672 
Cys Trp Cys Val Asp Glu Gly Gly Val Lys Val Pro Gly Ser Asp Val 
180 185 190 

AGA TTC AAA CGT CCC ACA TGC TAA 696 
Arg Phe Lys Arg Pro Thr Cys 
195 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



4 



(ii) MOLECULE TYPE, protein 
Ul) SEQUENCE DESCHIPTIOB: BEQ ID „o- 4- 
- Leu 3„ 0.„ 

lit - - - 3:„ 

3« .e„ L. ... 3. Le„ 0. 3,. 3er Oi. .e„ 

O^V v.. 3.„ L 3.. oi„ ..e 3^ 3:„ 

^ 30 



^" «^ - c= .3P 3.. 

45 

Gly Lys Glu He Leu Gly Thr Lv^ tt . 

^ ''^^ Gly ser Pro Asp Cys 



55 ~^ ""^ "''-^ ^sp cys Ser 

60 

" - - Le„ „e. „e. 3.. ... 

"° - - - L.. ... 

3« ... 3X„ V. 3. « 3er 3:. 3. J! 3.= 

110 

V- .SP L.e 3.. L.e 31„ Leu 3.„ 3.. 

■ - — - ne L.e 

140 

Ala His Ser Asn Qei^ 

Se. Leu v.l 31u Met P.e Val p„ 

160 

Glu Asp Gly Ser Tyr Asn Pro Val Gin r 

o Val Gin Cys Trp Pro Ser Thr Gly Tyr 

cys Trp Cys Val Asp Glu Gly Glv Val t w . 

190 

Arg Phe Lys Arg Pro Thr Cys 
195 ^ 



